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Abstract 


Given  a  universal  relation  scheme,  presented  as  a  set  of  attributes  and  a  set  of  dependencies,  it 
may  be  advantageous  to  decompose  it  into  a  collection  of  schemes,  each  with  its  own  sets  of 
attributes  and  dependencies,  which  has  some  desired  properties.  A  basic  requirement  for  such 
a  decomposition  to  be  useful  is  that  the  corresponding  decomposition  map  on  universal  rela¬ 
tions  be  injective.  A  central  problem  in  database  theory  is  to  find  the  reconstruction  map,  i.e., 
the  inverse  map  of  an  injective  decomposition  map.  We  prove  here  that  when  the  decomposi¬ 
tion,  viewed  as  a  hypergraph,  is  acyclic  and  the  given  dependencies  arc  full  implicalional  depen¬ 
dencies,  then  the  reconstruction  map  is  the  natural  join.  Based  on  this,  we  show  that  there  is  a 


polynomial  time  algorithm  to  test  for  injectiveness  of  decompositions 
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1.  Introduction 

A  significant  portion  of  research  on  relational  database  theory  has  been  concerned  with 
the  properties  of  database  decompositions.  The  generic  problem  can  be  described  as  follows. 
Given  a  “universal”  scheme  presented  as  a  set  of  attributes  and  a  set  of  dependencies,  what  are 
the  conditions  under  which  it  can  be  decomposed  into  a  collection  of  schemes,  each  with  its 
own  sets  of  attributes  and  dependencies,  having  some  desired  properties.  The  properties  con¬ 
sidered  were,  at  first,  various  normal  forms,  (sec  (Ma,  Ul]).  It  was  then  realized,  however,  that 
there  is  a  more  fundamental  property,  called  faithfulness,  that  has  to  be  satisfied  by  decomposi¬ 
tions  [R1,BR,MMSU].  Intuitively,  a  decomposition  is  faithful  if  the  relations  corresponding  to 
the  different  schemes  can  be  updated  separately. 

A  basic  assumption  underlying  these  ideas  is  that  when  a  universal  scheme  is  decomposed 
into  smaller  schemes,  each  of  the  universal  relations  associated  with  it  is  decomposed  into 
smaller  relations  using  the  projection  operation,  i.e.,  each  such  relation  is  projected  onto  each 
one  of  the  smaller  schemes.  For  a  decomposition  to  be  faithful,  we  must  not  lose  any  informa¬ 
tion  by  decomposing  the  universal  relations,  that  is  the  decomposition  map  must  be  injective. 
In  other  words,  it  should  be  possible  to  reconstruct  the  universal  relations  from  their  projec¬ 
tions.  The  desirability  of  injectiveness  is  called  in  [BBC]  the  representation  principle. 

The  question  raised  now  is  as  follows:  Given  that  a  decomposition  is  faithful,  what  is  the 
resulting  reconstruction  map?  The  most  natural  choice  for  this  map  is  the  (natural)  join  opera¬ 
tion.  The  problem  is  whether  indeed  the  reconstruction  map  is  the  join.  This  problem  was 
first  presented  in  [Rl],  where  it  is  answered  affirmatively  for  the  case  that  only  functional 
dependencies  arc  given  and  the  decomposition  is  into  two  schemes  only.  This  result  was  gen¬ 
eralized  in  [BR,  MMSU],  where  the  restriction  on  the  number  of  schemes  was  removed.  It  was 
generalized  further  in  [V2J  to  the  case  where  full  implicational  dependencies  are  given.  In  con¬ 
trast,  it  is  shown  there  that  if  we  allow  arbitrary  first-order  sentences  instead  of  full  implica¬ 
tional  dependencies,  then  the  reconstruction  map  docs  not  have  to  be  the  join. 
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The  result  in  [V2]  assumes  that  both  finite  and  infinite  databases  are  considered.  It  is 
more  realistic*  however,  to  assume  that  database  arc  inherently  finite.  TTiis  is  the  case  that  we 
consider  in  this  paper.  While  we  cannot  show  that  the  reconstruction  map  is  the  join,  we  prove 
it  for  the  important  case  that  the  decomposition  is  acyclic  [FMU,  BFMY].  Furthermore,  in  this 
case  the  condition  of  injectiveness  is  equivalent  to  the  condition  that  the  reconstruction  map  be 
the  join.  (The  same  result  was  shown  independently  in  [CP]  for  the  less  general  case  where 
functional  and  join  dependencies  arc  given  and  and  the  decomposition  is  into  two  schemes.)  In 
contrast,  it  is  shown  in  [V2]  that  for  cyclic  decomposition  injectiveness  does  not  imply  that  the 
reconstruction  map  is  the  join.  Finally,  based  on  this  characterization  of  injectiveness,  we  show 
that  there  is  a  polynomial  time  algorithm  to  lest  for  injectiveness  of  decompositions. 

1  Basic  Definitions 

2.1.  Relation  and  Dependencies 

We  assume  familiarity  with  the  terminology  and  concepts  of  relational  database  theory  as 
presented  in  [Ma,  Ul].  We  use  l[X]  to  denote  the  projection  of  the  relation  /  on  the  attribute 
set  Xy  and  *lj  to  denote  the  join  of  the  set  {/,}  of  relations.  We  assume  that  the  relations  we 

are  dealing  with  and,  accordingly,  the  dependencies  that  refer  to  them  are  typed  that  is,  distinct 

attributes  have  disjoint  domains.  We  also  assume  that  all  relations  are  finite.  The  universal 

relation  scheme  is  denoted  t/,  and  a  database  scheme  is  a  set  R  =  {R\ . /?*}  of  distinct  rcla- 

* 

tion  scheme  such  that  [J  R,  =  U. 

/= l 

The  dependencies  wc  use  here  are  the  full  implicational  dependencies  (fid’s)  [BV,  F2J,  i.e., 
equality-generating  dependencies  (cgd’s)  and  full  tuple-generating  dependencies  (ftgd’s).  We 
denote  the  class  of  relations  that  satisfy  a  set  1  of  dependencies  by  S/f7X2),  and  wc  denote 
logical  implication  by  |=. 
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The  class  of  join  dependencies  [ABU,R2]  is  a  subclass  of  the  class  of  ftgd’s.  They  were 
originally  introduced  using  the  following  notation.  A  join  dependency  (jd)  is  a  statement  *[R], 
where  R  is  a  database  scheme  [R\ . /?*}.  It  is  satisfied  by  a  relation  /  on  U  if 

k 

I  ^/ [/?,  ].  Let  R  and  S  be  database  schemes.  We  say  that  S  coven  R  if  for  all  R  in  R  there 

is  some  S  in  S  such  that  RCS.  It  is  known  [BMSU]  that  ifS  covers  R  then  *[R]|=*[S].  A 
multivalued  dependency  (mvd)  [FI,  Za]  is  essentially  a  “binary”  jd,  i.e.,  it  is  a  jd  1,^2!- 
(Note  that  R lU/?2=  U).  This  notation  differs  from  the  standard  arrow  notations  for  mvd’s. 

Let  R  =  {Ri, . . . ,  Rk}  be  a  database  scheme.  If  R]  and  R2  are  subsets  of  R,  then  Ri,R2 
is  a  partition  of  R  if  RinR2  =  0  and  R  =  RjUR2.  A  database  scheme  R  is  said  to  be  acyclic 
[FMU,BFMY]  if  the  jd  *{R]  is  logically  equivalent  to  the  set  of  mvd’s 

{*[{J  Ri.U  *2)  •  Ri»R2  is  a  partition  of  R}. 

Note  that  one  direction  of  the  equivalence  is  true  for  any  database  scheme. 

We  will  use  here  two  properties  of  fids  that  are  shown  in  [VI, V3]. 

(1)  Let  X  be  a  set  of  fid's  and  let  r  be  an  mvd.  If  Z  ^  t  then  there  is  a  relation  /  such  that 
|  / 1  =2,  /  satisfies  Z,  and  /  does  not  satisfy  t. 

(2)  There  is  a  quadratic  time  algorithm  to  test  for  a  given  finite  set  Z  of  fid’s  and  an  mvd  r 
whether  Z  | =r. 

II  Decompositions 

A  database  on  the  database  R  =  {Rlf . . . ,  Rk}  is  a  collection  {/ 1.  of  relations  on 

/?! . Rk ,  correspondingly.  With  each  database  scheme  R  wc  associate  a  decomposition  map 

AR.  Given  a  relation  /  on  l/,  AR  applied  to  /  yields  a  database  on  R: 

A  decomposition  map  Ar  is  injective  with  respect  to  a  set  X  of  dependencies  if  Ar  is  injective 
on  SATCS.).  That  is,  if  /  and  J  are  two  distinct  relations  in  SATVL),  then  Ar(/)^Ar(J).  If 


/ 
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AR  is  injective  with  respect  to  2  then  it  has  an  inverse  map 

p„:{AR(/):  KSAT(3:)}-+SAT(n 

such  that  pR(AR(/))=/  for  all  /  in  SAT(l).  pR  is  called  the  reconstruction  map.  If 
* 

pR(/i . /*)=  *  Ij  then  we  say  that  the  reconstruction  map  is  the  join.  Another  map  asso- 

7*1 

ciated  with  a  database  scheme  R  is  the  project- join  map  mR  defined  by 

»»«(/>=  .*/[*/! 

y  =  l 

Note  that  a  relation  /  satisfies  *[R]  exactly  when  wR(/)=/. 

3.  The  Main  Result 

Let  /  be  a  relation  on  U.  A  permutation  on  Ms  an  injective  map  a  from  the  set  of 
values  in  /  into  itself  such  that  the  set  of  values  for  each  attribute  is  mapped  into  itself.  A  per¬ 
mutation  on  I  is  essentially  a  vector  of  permutations,  one  for  each  attribute  of  U.  We  denote 
by  a(/)  the  relation  obtained  by  replacing  simultaneously  each  value  in  /  by  its  image  under 
a. 


Lemma  1.  [BV]  Let  /  be  a  relation  on  U  and  let  a  be  a  permutation  on  /.  Then  I  and  a(/) 
satisfy  exactly  the  same  fid’s.  ■ 

Obviously,  I  and  a(f )  have  the  same  projection  on  each  singleton  attribute  set  {A}. 
They  need  not  have,  however,  the  same  projections  on  larger  attribute  sets.  A  permutation  a 
on  a  relation  /  preserves  a  database  scheme  R  if  AR(/)= AR(ot(/)). 

Lemma  2.  Let  R  =  {R\,R2}  be  a  database  scheme,  and  let  /={w1,w2}  be  a  relation  on  V . 
For  every  tuple  >v  in  mR(/)  there  exists  a  permutation  a  on  /  that  preserves  R  such  that  w  is 
in  a(7). 

Proof.  If  w  is  in  /  then  take  a  to  be  the  identity  permutation,  so  we  can  assume  that  w  is  not 
in  /.  If  however  either  R\CR2  or  R2CR i,  then  wR(/)=/,  so  assume  that  the  two  sets  are 
incomparable.  We  can  also  assume  that  and  >vi[R2\^w2[R2i  otherwise 


wr(/)=  /.  Finally,  h^xI/? iD/f  J  =  w2[RiC\R J,  since  otherwise  mH(/)=/. 

Let  and  ^[^2J=  Define  a  to  be  the  identity  on  each  attribute  in 

ftj.  For  an  attribute  in  /f  2,  a  exchanges  the  values  *vi(/l]  and  v>'i[A  ].  a  is  well  defined,  since 
H'i[/Jin/t2]  =  Now  we  have  that  a(>vi)[/?J  =  >V)[fi1]  and  a(»’i)[/?2)=  *’2(^2].  so 

w  =  a(wi)€a(I).  It  is  also  easy  to  see  that  a  preserves  R.  ■ 

The  relationship  between  covering  and  preservation  is  pointed  to  in  the  following  easy 
lemma. 

Lemma  3.  Let  R  and  S  be  database  schemes  such  that  S  covers  R.  If  a  is  a  permutation  on  a 
relation  I  that  preserves  S,  then  it  also  preserves  R.  ■ 

We  can  now  prove  our  main  result. 

Theorem  1.  Let  2  be  a  set  of  fid’s,  and  let  R  be  an  acyclic  database  scheme.  The  following 
three  conditions  are  equivalent: 

(1)  Ar  is  injective  with  respect  to  2. 

(2)  2N'[R]. 

(3)  pr  is  the  join. 

Proof. 

(1  ->  2):  Suppose  that  AR  is  injective  with  respect  to  2  but  2  ^*[R].  Since  R  is  acyclic,  there 
is  a  partition  Ri.Rj  of  R  such  that  2  where  S= {(J  Rj,(J  R2}.  By  property  (1)  of  fid’s, 
there  is  a  relation  /  =  {*vi,wj)  such  that  /  satisfies  2  but  I  docs  not  satisfy  *(Sj.  Since  /  does 
not  satisfy  S,  there  is  a  tuple  *v  that  is  in  m^I)  but  not  in  /.  By  Lemma  2,  there  is  a  permuta¬ 
tion  a  on  I  such  that  >v  is  in  a(l),  and  a  preserves  S.  But  then  we  must  have  /  ^  a(/),  since 
tv  is  not  in  /,  and  by  Lemma  1, /  and  a(l)  both  satisfy  2.  We  also  have  that  a  preserves  R, 
because  S  covers  R.  Therefore,  AR(/)=A|t(a(/))  -  in  contradiction  to  the  injcctivencss  of  AR 
with  respect  to  2. 

(2  ->  3):  Let  /  be  a  relation  in  SAT( 2).  Since  2 1=  *[R],  /  =mR(/).  That  is, 
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/  =  }  ![*,]= 

1=1 

where  pR  is  the  join. 

(3  ->  1):  If  pR  exists,  then  AR  must  be  injective  with  respect  to  X.  ■ 

We  now  show  that  the  condition  of  Theorem  1  can  be  tested  efficiently. 

Theorem  2.  There  is  cubic  time  algorithm  to  test  for  a  given  finite  set  2  of  fid's  and  an  acyclic 
database  scheme  R  whether  AR  is  injective  with  respect  to  2. 

Proof.  By  Theorem  1  it  suffices  to  test  whether  2  *[R].  Our  strategy  is  to  construct  first  a 

set  T  of  mvd’s  that  is  logically  equivalent  to  *[R]  (by  acyclicity),  and  then  to  test  if  2  }=r  for 
each  r  in  T.  By  property  (2)  of  fid's,,  each  of  the  latter  tests  can  be  done  in  quadratic  time. 
The  crux  of  the  proof  is  showing  how  to  construct  T  such  that  |  T  |  <  |  U  |  and  the  construction 
can  be  done  in  quadratic  time. 

If  R  is  acyclic  then  |R|  <U  and  there  is  a  set  T  of  mvd’s  that  is  logically  equivalent  to 
*[R]  such  that  |  T  |  <  |  R  |  [BFMYJ.  In  order  to  construct  V  we  have  to  construct  first  a  join 
forest  for  R.1  This  join  forest  can  be  constructed  in  time  linear  in  the  size  of  R  [TY].  Then 
every  mvd  in  V  can  be  constructed  from  the  join  forest  in  time  linear  in  the  size  of  R  [BFMY]. 
Since  |  T  |  <  |  U  | ,  the  claim  follows.  ■ 

4,  Concluding  Remarks 

We  have  shown  that  the  reconstruction  map  is  the  join  for  quite  a  general  situation, 
namely  when  the  dependencies  arc  fid's  and  the  decomposition  is  acyclic.  We  note  that  most 
classes  of  dependencies  treated  in  the  literature  are  special  cases  of  fid’s.  An  exception  is  the 
class  of  inclusion  dependencies  [CFP],  Our  results  can  be  generalized  to  deal  also  with  inclu¬ 
sion  dependencies  [KCV].  As  for  the  restriction  that  the  decomposition  be  acyclic  there  are 
arguments  to  the  effect  that  most  real  life  situations  can  be  captured  by  such  decompositions. 

1  A  join  forest  for  R  is  a  labeled  acyclic  graph  with  the  elements  of  R  as  nodes,  an  edge  connecting  Rt  to  Rj  is 


Wc  have  also  shown  how  to  test  efficiently  for  injcctivencss.  We  mentioned  that  there  is 
another  desirable  property  of  decompositions,  called  surjectiveness  [V2].  Faithful  decomposi¬ 
tions  are  both  injective  and  surjective.  When  only  functional  dependencies  are  given  there  is  a 
polynomial  time  test  for  faithful  [BH,  BR,  MMSU].  We  do  not  know  of  any  effective  test 
when  fid's  are  given,  even  when  the  decomposition  is  acyclic. 
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